#Image Generation

63 articles

TechApr 27, 20267 min

LLaDA2.0-Uni Is an Open-Weight Diffusion LLM That Unifies Image Understanding and Generation

Inclusion AI released LLaDA2.0-Uni. A 16B MoE diffusion LLM that handles image understanding, 1024px image generation, image editing, and interleaved text-image generation in a single model.

AI LLM Image Generation VLM MoE Open Model Multimodal

TechApr 26, 20267 min

Qwen-Image-2.0-Pro Looks Like an API-Side Upgrade for Now

Checked the 2026-04-22 snapshot of Qwen-Image-2.0-Pro. It ranks 9th overall on Arena Text-to-Image, but no official open weights on Hugging Face yet.

Qwen Image Generation AI LM Arena

TechApr 25, 2026updated20 min

WAI-Anima character LoRA on RunPod with AnimaLoraToolkit: $1.22 to train, but ponytail direction won't follow prompts

Trained a WAI-Anima character LoRA on RunPod (AnimaLoraToolkit + sd-scripts) for $1.22, but at inference the side-ponytail direction won't shift with Danbooru tags or natural language — a directional bias from Anima base. Full verification record.

LoRA AI Image Generation Anima WAI-Anima RunPod ComfyUI Qwen Experiment

TechApr 24, 2026updated14 min

WAI-Illustrious v17 hands-on: hires fix auto-corrects hands and feet, 4 rating tags, do v16 LoRAs still work?

WAI-Illustrious SDXL v17 tested on M1 Max 64GB ComfyUI against v16 with the same seed. Hires fix now auto-corrects hands and feet, the four rating tags (general/sensitive/nsfw/explicit) still drive NSFW output, and v16-trained LoRAs mostly carry over — with one case where they don't.

AI Image Generation ComfyUI Stable Diffusion LoRA Apple Silicon Experiment

TechApr 17, 202610 min

Testing Z-Image i2i for Pixel Art Conversion

Z-Image has its own pixel art LoRAs, but can they actually convert photos to pixel art via i2i? Tested Z-Image Turbo, base model, and compared with Illustrious on M1 Max 64GB.

Z-Image Image Generation Apple Silicon Experiment

TechApr 17, 2026updated9 min

WAI-Anima v1 on RTX 4060 Laptop (8GB) via ComfyUI API: 55s/image and the tqdm OSError fix

Tested WAI-Anima v1 on Windows + RTX 4060 Laptop GPU (8GB VRAM). Headless execution via ComfyUI API hit a tqdm OSError on startup, but launching ComfyUI normally generates a single image in 55 seconds. Includes the workaround and timing notes.

AI Image Generation ComfyUI Windows NVIDIA Stable Diffusion LoRA Experiment Anima WAI-Anima

TechApr 16, 2026updated13 min

WAI-Anima v1 vs WAI-Illustrious on M1 Max ComfyUI: brings Anima's atmospheric backgrounds but loses on tag control and character consistency

Tested WAI-Anima v1, Anima preview3-base, and WAI-Illustrious v160 side by side on M1 Max 64GB ComfyUI with same seed/prompt. WAI-Anima inherits Anima's atmospheric lighting and natural running poses but still loses to WAI-Illustrious on tag control and character consistency. Includes i2i pipeline test (denoise 0.5), ~275s generation times, and how the Anima derivative ecosystem (WAI-Anima, CottonAnima, Kirazuri, RDBT) expanded in two months.

AI Image Generation ComfyUI Qwen Apple Silicon Stable Diffusion LoRA Experiment Anima WAI-Anima

TechApr 14, 202610 min

Can Qwen Image Edit Convert Photos to Pixel Art?

Tested 5 approaches including Qwen Image Edit, JS color reduction, and Illustrious i2i + LoRA. Illustrious i2i alone turned out to be the fastest and lightest solution for pixel art conversion.

Qwen Image Generation Apple Silicon Experiment

TechApr 1, 2026updated14 min

See-through anime PSD test: 23-layer character decomposition with LayerDiff and Marigold

Testing See-through for anime character PSD decomposition: 23 generated layers, front/back hair separation, hidden-area inpainting, and what LayerDiff + Marigold actually produced from a single illustration.

Experiment AI Anime Image Generation Live2D RunPod SDXL ComfyUI

TechMar 22, 20266 min

Luma AI's Uni-1 unifies understanding and generation in a single Transformer

Luma AI's Uni-1 integrates image understanding and generation in one decoder-only autoregressive model. It does not use diffusion; instead, it tokenizes text and image patches in a shared vocabulary and generates them sequentially.

AI Image Generation Luma AI Transformer

TechMar 18, 2026updated8 min

ComfyUI on Blackwell GPUs (RTX 5090 / RTX PRO 6000): why sm_120 fails and the PyTorch Nightly fix that works

Why ComfyUI breaks on NVIDIA Blackwell (sm_120) GPUs with 'no kernel image is available for execution' errors, and a working setup using PyTorch Nightly, xformers removal, SageAttention, and NVFP4 quantization. Tested on RTX PRO 6000 Blackwell.

ComfyUI NVIDIA GPU Blackwell Image Generation

TechMar 5, 2026updated20 min

Testing Live2D Face-Part Separation with Qwen-Image-Layered on RunPod

Using tori29umai’s LoRA to automatically split facial parts, results from batching 28 images, and a log of running into the limits when attempting finer hair separation

RunPod Qwen diffusers Image Generation LoRA Live2D Experiment